Statistical methods for linguistic research: Foundational Ideas - Part II

نویسندگان

  • Bruno Nicenboim
  • Shravan Vasishth
چکیده

We provide an introductory review of Bayesian data analytical methods, with a focus on applications for linguistics, psychology, psycholinguistics, and cognitive science. The empirically oriented researcher will benefit from making Bayesian methods part of their statistical toolkit due to the many advantages of this framework, among them easier interpretation of results relative to research hypotheses, and flexible model specification. We present an informal introduction to the foundational ideas behind Bayesian data analysis, using, as an example, a linear mixed models analysis of data from a typical psycholinguistics experiment. We discuss hypothesis testing using the Bayes factor, and model selection using cross-validation. We close with some examples illustrating the flexibility of model specification in the Bayesian framework. Suggestions for further reading are also provided.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical Methods for Linguistic Research: Foundational Ideas - Part I

We present the fundamental ideas underlying statistical hypothesis testing using the frequentist framework. We start with a simple example that builds up the one-sample t-test from the beginning, explaining important concepts such as the sampling distribution of the sample mean, and the iid assumption. Then we examine the meaning of the p-value in detail, and discuss several important misconcep...

متن کامل

Descriptive and Foundational Aspects of Quantum Cognition

Quantum mechanics emerged as the result of a successful resolution of stringent empirical and profound conceptual conflicts within the development of atomic physics at the beginning of the last century. At first glance, it seems to be bizarre and even ridiculous to apply ideas of quantum physics in order to improve current psychological and linguistic or semantic ideas. However, a closer look s...

متن کامل

A Corpus of Online Discussions for Research into Linguistic Memes

We describe a 460-million word corpus of online discussions. The data are collected from public news websites and community-ofinterest Internet forums, and are designed to support research on the propagation of socially relevant ideas, a.k.a., “memes.” A structural and statistical description of the corpus is given, and the employed methods of website monitoring, collection, and extraction are ...

متن کامل

Thermodynamics of biological processes.

There is a long and rich tradition of using ideas from both equilibrium thermodynamics and its microscopic partner theory of equilibrium statistical mechanics. In this chapter, we provide some background on the origins of the seemingly unreasonable effectiveness of ideas from both thermodynamics and statistical mechanics in biology. After making a description of these foundational issues, we tu...

متن کامل

Statistical Machine Learning and Computational Biology

Statistical machine learning is a field that combines algorithmic ideas with foundational concepts from probability and statistics. This combination makes statistical machine learning an essential tool for computational biology, in part because probabilistic notions are inherent in biology (arising, e.g., via thermodynamics, recombination and germline mutation) and in part because of the incomp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Language and Linguistics Compass

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2016